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Abstract. We present a definition of untyped A-terms using a hetero- 
geneous datatype, i.e. an inductively defined operator. This operator 
can be extended to a Kleisli triple, which is a concise way to verify the 
substitution laws for A-calculus. We also observe that repetitions in the 
definition of the monad as well as in the proofs can be avoided by using 
well-founded recursion and induction instead of structural induction. We 
extend the construction to the simply typed A-calculus using dependent 
types, and show that this is an instance of a generalization of Kleisli 
triples. The proofs for the untyped case have been checked using the 
LEGO system. 
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1 Introduction 

The metatheory of substitution for A-calculi is interesting maybe because it 
seems intuitively obvious but becomes quite intricate if we take a closer look. 
[Hue92] states seven formal properties of substitution which are then used to 
prove a general substitution theorem. When formalizing the proof of strong nor- 
malisation for System F [Alt93b,Alt93a] the first author formally verified five 
substitution properties quite similar to those of [Hue92]. 

Therefore it seems a good idea to look for a more general and elegant way to 
state and verify the substitution laws. Obviously, this is also related to the way 
lambda terms are presented. 

We find a partial answer in the work of Bellegarde and Hook [BH94] who 
take the view that lambda terms should be represented by an operator Lam € 
Set -» Set, where Set denotes the universe of sets, such that Lam(X) is the set 
of A-terms with variables in X. This corresponds to the presentation of terms in 
universal algebra as an operator Term e Set — >■ Set. The substitution laws are 
captured by verifying that Lam can be extended to a monad or equivalently to 
a Kleisli triple (cf. Section 2.1, see also [Man76,Mog91]). 

In this paper we are going to revise and extend the work of Bellegarde and 
Hook in the following ways: 



— The presentation of Lam, see Section 3.2, is improved by using a heteroge- 
neous datatype 1 , i.e. there are no meaningless terms in our representation. 
Heterogeneous datatypes have already been discussed in [BM98], where they 
are called nested datatypes and modelled by initial algebras in functor cat- 
egories, which seems unsatisfactory. Building on this approach, in [BP99] 
heterogeneous definitions of untyped A-terms are investigated. 

— Repetitions in the definition of the monad and in the verification can be 
avoided by using well founded recursion (along a primitive recursive well- 
ordering) instead of structural recursion, see section 4. 

— The development has been verified using the LEGO system, see section 4.5. 

— We also extend this approach to the simply typed A-calculus, see Section 5. 
To do this we present a generalization of Kleisli triples, which we call Kleisli 
structures, see 5.1. 

— We analyze the type of inductive definitions needed in every step of the 
formalization using initial algebras of functors. We consider two generaliza- 
tions of the usual scheme of inductive definitions: heterogeneous (see 3.1) 
and dependent inductive definitions (see Section 5.2). 

Our work seems to be closely related to recent work by Fiore, Plotkin and 
Turi [FPT99] who pursue a more abstract algebraic treatment of signatures with 
binders but do not cover the simply typed case. Higher order syntax can also be 
used to represent A-terms, i.e. in [Hof99]. 

2 Preliminaries 

As a metatheory we use an informal version of extensional Type Theory, details 
can be found in [Mar84,Hof97]. Since we do not exploit the proposition-as-types 
principle we work in a system quite close to conventional intuitionistic set theory. 
We use Set and Prop to denote the types of sets and propositions. 

Notationally, we adopt the following conventions: We write the type of im- 
plicit parameters of dependent functions as subscripts, i.e. 17„GNatFin(n) — i > Set 
is a type of functions whose first argument is usually omitted. The hidden argu- 
ment can be made explicit by putting it in subscript, i.e. we write e.g. fx £ T(X) 
when we mean / £ IIxecT(X) for some type C £ Set obvious from the con- 
text. Given P, Q £ A — >■ Prop we write P C Q for Va £ A.P(a) — > Q(a). Given 
a curried function f £ A\ — > A^ — > . . . — > A n we write the application to a 
sequence of arguments a±, 02 , . . . , a n as f(a±, 02 , . . . , a n ). The same convention 
holds for 17-types. 

The rest of this section briefly reviews Kleisli triples, initial algebras, and 
inductive datatypes and might be skipped by the experienced reader. 


2.1 Kleisli triples 

We present monads as Kleisli triples, i.e. 

1 It seems that the idea for this presentation goes back to Hook, but he didn’t use it 
in the paper because it cannot be implemented in SML. 



Definition 1. A Kleisli- Triple (T, r? T ,bind T ) on a category C is given by 

— an function on the objects: T £ |C| -* |C| 

— a family of morphisms indexed by objects X £ |C|: r, T x GC(X,T(X)) 

— a family of functions indexed by X. Y £ |C|: 

bindj y € C (X,T(Y)) -> C(T(X),T(Y)) 

which are subject to the following equations: 

1 . bmdx tX (Vx) = 1 t(x) 

2. bind x,y{f) ° Vx = f where / £ C(X,T(Y)). 

3. bind J z (bind y Z (/) o g) = bindy Z (/) o bind5,y(s) 
where / € C(r,T(Z)), fl £ C(X,T(y)). 

Kleisli triples were introduced in [Man76], where they are also shown to be 
equivalent to the conventional presentation of monads, see [ML71], pp.133. 


2.2 Initial algebras 

Definition 2. For any endofunctor T : C — > C an initial T-algebra (ji l . c T , It T ) 
is given by 


— an object fx T € |C| 

— a morphism c T € C(T(/i T ), n T ) 

— a family of functions indexed by X £ |C|: ItJ € C (T(X),X) — > C(/u T ,X) 


such that given a T-algebra / € C(T(X), X): 


t(m t ) 

T(It T x (f)) 

T(X) 



f 


It T x(f) 

X 


commutes and It x(/) is the unique morphism with this property, i.e. given any 
h € C (fj, T ,X) we have h = lt x (/)■ 

fi T is called weakly initial if It5(/) exists but is not necessarily unique. 


We assume that our ambient category Set is bicartesian closed, i.e. has finite 

products 1,— x — , coproducts 0, — I — and function spaces > — . We say 

that a variable appears strictly positive in a type, if it appears never on the left 
hand side of an arrow type, and positive, if it appears only on the left hand side 
of an even number of nested arrow types. 


2.3 Inductive datatypes 

To model inductive types we introduce the concept of a strictly positive operator, 
i.e. a function T £ Set — ► Set which is given by a definition T(X) = a(X) such 
that X appears strictly positive in a(X). Here we write a(X) for a syntactic type 



expression in which the variable A may occur. Every strictly positive operator 
gives rise to an endofunctor on Set. 

Given a strictly positive operator T we introduce /i T = /iX.T(X) £ Set to 
denote the initial T-algebra. We extend strictly positive to /r-types, s.t. yu-types 
can be used to define new operators. We say that Set has all strictly positive 2 
datatypes if all initial algebras defined by a strictly positive operator exist. This 
gives rise to a A-calculus A #t , e.g. see [Alt98] . 

Examples for inductive datatypes are natural numbers Nat = /rA.l + X, 
ordinal notations Ord = fxX.l + X + (Nat — > A") or finitely branching trees 
Tree = fiX./jY.l + X xY. 

Datatypes can be conveniently presented by constructors and their types, i.e. 
Nat £ Set can be presented as 0 € Nat and succ £ Nat -* Nat, analogously 
Ord £ Set is given by O' £ Nat, succ' € Nat -> Nat and lim £ (Nat -> Ord) -» 
Ord. Nested types like Tree correspond to simultaneous inductive definitions, 
i.e. Tree, Forest £ Set is given by nil £ Forest, cons £ Tree — > Forest — > Forest, 
and span £ Forest — > Tree. 

Parametrized datatypes like lists can be defined as a function List £ Set — > 
Set given by List (A) = fiY.l+Y x A. List is homogeneous because the parameter 
A does not change in the inductive definition. 

Assuming weak initiality, the uniqueness property can be alternatively ex- 
pressed by an induction principle, i.e. given a predicate P £ fi T — > Prop we 
have C T (P) g p 


■ Dat — Ind 


Vx £ n T ,P(x) 
where c T (P) = {c T (x) | x £ P}. 

It is well known that all positive inductive types can be encoded impredica- 
tively (i.e. in System F, [GLT89]): 

/iX.T(X) = IIX £ Set .(T(X) — >■ X) — >■ X £ Set 

It T = AA £ Set. A/ £ T(X) -> X.Xx £ pX.T(X). x X f 
£ nX £ Set ,(T(X) -»■ X) -»■ p T ->■ A 

X.f(T(It T X /) x) 


= Ax £ T(fj,X.T(X)).XX £ Set. A/ £ T( X) 

T 


£ nf) 


Here T (— ) £ IIx,YeSet(X -> Y) -> T( A) — > T(Y) is the morphism part 
of the functor which can be derived from the fact that it is given by a posi- 
tive definition. This encoding is weakly initial, uniqueness can be derived from 
parametricity [Wad89,AP93]. 


3 A-terms as a heterogeneous datatype 

3.1 Heterogeneous inductive datatypes 

We interpret heterogeneous datatypes by initial algebras in the category of fami- 
lies of sets Fam. Objects in Fam are families F £ Set — > Set and given families 

2 Strictly positive can be replaced by positive , but it is not obvious whether this ex- 
tension is still predicative. 



F,G £ |Fam| morphisms are families of functions / £ II X £ Set. F(X') — >■ 
G(A r ). A strictly positive operator on families is a function H £ (Set -> Set) -> 
(Set — » Set) which is given by a definition H(F) = XX £ Set .a(F,X) where 
F appears only strictly positive in a(F, X). Every strictly positive operator on 
families gives rise to an endofunctor on Fam. 

Given a strictly positive operator H on families there exists an initial algebra 
fi H = pF.XX £ Set. H(F, X) £ Set -» Set. As before we define operators 
and inductive types simultaneously. The constructors c H and lt H now refer to 
morphisms in Fam — this can be spelt out as follows: 

c H £ n xe setH(n H ,X) -> v H (X) 

it H £ n FeSet ^ Set (n xeSet H(F,x) -> f(x)) -> n xeSet v H (x) -»■ f(x) 

The uniqueness property of the inductively defined operator can be also 
expressed by the following induction principle: Assume a family of predicates 
p e n XeSet ii H (X) -> Prop: 

VT e Set. c h (P y ) c Py 
— Het — Ind 

VF e Set.Vx € h h {Y).P y {x) 

The A-calculus corresponding to heterogeneous polymorphic definitions has 
to our knowledge not yet been explored. 

Positive heterogeneous inductive types can be encoded impredicatively (i.e. 
in System F“): 

=\ Y £ Set. IIF £ Set Set.(IIxeSeitH(F,X) ->■ F(X)) -* F(Y) 

£ Set — > Set 

lt H — \F £ Set Set. A / £ IIxesetH(F, X) -> F(X). XX £ Set. 

Xx £ . x(F. /) 

£ IIF £ Set -t Set. (JTxgsetT(F, X) -> F(X)) 

-> n xeSe tixF.H(F, X) -t F(X) 
c 11 — XX £ Set. Ax £ T(h h ,X). X F £ Set -» Set. 

A / £ II xeSet H(F,X) -+ F(X).f(H(It H ,F,f),x) 

£ n X es e tH( f j. H ,X)^ l j. H (X) 


3.2 Definition of Lam 

An example for a heterogeneous inductive datatype is the operator Lam £ Set — >■ 
Set from the introduction which can be defined as 

Lam = p.F £ Set -> Set.AX e Set .A + (F(X) x F(X)) + F(X ± ) 

where X± = 1 + X with two constructors new £ FIX £ Set. Ax and old £ 
Fixe SetA -» Ax and eliminator case £ II X} YeSetY — > (A — > Y) — > Ax — > Y. 
Clearly (_) ± gives rise to a functor. 



As before we can present inductively defined operators by giving the con- 
structors, which in the case of Lam read as follows: 

var e IIxESetX -» Lam(A) 

app € i7x6SetLam(X) — > Lam(A r ) — » Lam(A r ) 

abst € 77xeSetLam(A r j_) — >■ Lam(A) 

4 Lam is monadic 

To show that Lam has the structure of a Kleisli triple we first have to define rf^ m 
and bind^ a ™ • The former is simply varx and the latter can be defined recursively 
or structural inductively which gives rise to two different constructions. 


4.1 The recursive construction 

In this case bind and an auxiliary map lift 

lift e n x ,YeSet(X -> Lam(Y)) -s- A ± ->• Lam(Yj_) 
bind G IIx,YeSet(X — > Lam(Y)) — > Lam(A) — > Lam(Y) 

are defined by simultaneous recursion. The equations defining lift and bind re- 
cursively are given below. 

lift(/, new(A)) = var (new (Y)) 
lift(/, old(x)) = bind(var o old, f(x)) 
bind(/, var(x)) = f{x) 

bind(/,app(s,t)) = app (bind (/, s), bind (/, t)) 
bind(/, abst(t)) = abst (bind (lift (/), t)) 

We must first prove that bind is terminating. 

Definition 3. Let / € A — > Lam (B) for arbitrary A,B(E Set then let isVar(/) <S> 
3h : A -> B. f = var^ o h and 


fO if 
( 1 otherwise 


isVar(/) 


Now we are in a position to define a termination order for bind. For any 
recursive call bind(/',f') inside of bind (f,t) we must have ( f',t ') <b ( f,t ). To 
that end we define 


(/, t) < b (. f',t ') O v(f) < v(f') V (v(f) = v(f') A t < s t') 


where < s is the structural order on terms. As <6 is the lexicographic order on 
two well-founded orders we immediately get the following observation. 



Proposition 4. The order <b is well-founded. 

For the termination of bind the fact below is important. 

Proposition 5. For any f of appropriate type it holds that u(lift(/)) < v(f). 

Proof. Assume that v(f) =0 hence / = var o h. By case analysis it is easily 
verified that lift(/) = var ocase(new, oldoft,), hence u(lift(/)) = 0. Thus, we have 
shown that u(lift(/)) < v(f). 

Proposition 6. bind is a terminating function. 

Proof. The only difficult case is bind(/, abst(f)) = abst (bind (lift (/), t)). Since 
u(lift(/)) < v(f) and t < s abst(t) we get that (lift(/),t) <b (/, abst(t)). 

Condition 1. of Definition 1 holds by definition of bind. 

Proposition 7. Condition 2. of Definition 1 holds, i.e. 

Vt € Lam(X). bind(varx, t) = t. 

Proof. Proof by structural induction on t: The var , y - case is trivial. Assume that 
t = app (a, b) and bind(varx, a) = a and bind(varx, b) = b. Thus we obtain 

bind(varx,app(a6)) = app(bind(varx,a),bind(varx,&)) = app (a, b). 

Finally, assume that t = abst(s) and that bind(varx ± ,s)=s. Then 

bind (varx, abst (s)) = abst (bind (lift (varx), s)) = abst (bind (varx ± , s)) 

= abst(s) = t by induction hypothesis. 

Proposition 8. Condition 3. of Definition 1 holds: 

V/ € A Lam (B).Vg Lam(C). bind(<?) o bind(/) = bind(bind(<?) o /) 

Proof. Using extensionality and well-founded induction this amounts to prove 
three cases: The var and the app-cases are again easy. We concentrate on the 
abst-case. 

(bind(g) o bind(/)) (abst(f)) = bind(<?, bind(/, abst(f))) 

= bind(<?, abst(bind(lift(/), t))) 

= abst (bind (lift (g), bind (lift (/), t ))) 

= abst (bind (lift (g) o bind(lift(/)), t)) (ind.hyp.) 
= abst (bind (bind (lift (g) o lift (f)),t)). 

On the other hand bind(bind(g) o /, abst(t)) = abst(bind(lift(bind( 5 ) o f),t)) 
such that it remains to show 

lift (bind (g) ° /) = bind (lift (g) ) o lift(/) 

which is proved by extensionality and case analysis on the argument. First if the 
argument is a “new” variable then by definition of lift and bind: 

bind(lift(<?) o lift(/), new(A)) = lift(bind(g) o /, new(A)) 

In the other case we first distinguish whether isVar(/) holds or not: 



1 . Case: isVar(/): 

Then there is an ft £ A — > B such that / = var^ ° ft. 

lift (bind (g) o (var^ o ft)) o oldyi 
= lift o ft) o olcU (Def. bind) 

= bind(var£j_ o olds) o g oh (Def. lift) 

= lift (g) o olds o ft (Def. bind) 

= bind (lift (g)) o var^j. ° olds o ft (Def. bind reverse) 

= bind(lift(</)) o bind(vare ± ° olds) o v&ib o ft (Def. bind reverse) 

= bind(lift(<7)) ° lift(vars ° ft) ° old^i (Def. lift reverse) 

2. Case: -dsVar(/): 

lift (bind (p) ° /) ° old = bind(varc ± ° oldc) ° bind(g) o / (*) 

= bind(lift(<?)) o bind(varg ° olds) o f 
= bind(lift(g)) o lift(/) o oldyi 

For (*) it remains to show that 

bind(var<7 ± ° oldc) ° bind(g) = bind (lift (g)) o bind(vars ± o olds) 
which is proved below 

bind(varc ± ° oldc) ° bind (g) 

= bind (bind (varc ± ° oldc) ° g) (ind.hyp.) 

= bind (lift (g) o olds) 

= bind (bind (lift (g)) o vaiB ± ° olds) (Def. bind & ext.) 

= bind (lift (g ) ) o bind(var£ ± o olds) (ind.hyp.) 

The induction hypothesis is used three times. As we do not use structural in- 
duction we must give a termination order <' such that when proving 

(bind(a) o bind(/))(f) = bind(bind(5) o f.t) 

we use the induction hypothesis 

(bind(s') o bind(/'))(f') = bind(bind( 5 ') o /', t') 

only if ( ') <' (f,g,t) for an appropriate well-founded order <'. We define 
this order as follows 

(. f',g',t ') <' (f,g,t) <£>(/ = f /\g = g'At' < s t)v(v(f')+v(g') < v(f)+v(g)) . 

For the first application of the hypotheses the condition (/, g, t) <’ (/, g, abst(t)) 
holds by the structural order on the last argument. For the second we have to 
show (g, varc ± ° oldc, s) <' (/, g, s ) in case -dsVar(/) holds. As isVar(varc ± ° k) 
holds for any k, 0 = u(var c x ° oldc) < v(f) = 1, hence v(g) + u(varc ± ° oldc) < 
v(f) + v (g). The proof of the third case, (varc ± ° oldc, lift (5), s) <' ( f,g,s ), 
under the assumption -iisVar(/), is similar. 



One might argue that the proof is not constructive as we do a case analysis 
on the undecidable predicate isVar(/). However, we can instead introduce an 
additional precondition (isVar (/) V True) A (isVar(g) V True) where True corre- 
sponds to don’t know. We do case analysis over the disjunctions. When using a 
recursive hypothesis with / = varo/i we prove the precondition by a left injection 
(the same for g). 

We summarize the result: 

Corollary 9. (Lam(— ), var, bind) is a Kleisli triple. 


4.2 The construction by structural induction 

There is also a proof by structural induction. In this case we define bind and lift 
and also Lam(— ) the morphism part of the functor: 

Lam e IIx,y eSet(X — > Y) — » Lam(A r ) — > Lam(T) 

lift(/, new(X)) = var (new (F)) 
lift(/, old(x)) = Lam(old, f(x)) 

Lam(/, var(x)) = var (/(&)) 

Lam(/,app(s,f)) = app(Lam(/, s), Lam(/, t)) 

Lam(/, abst(t)) = abst(Lam(/j_, t)) 

bind(/, var(x)) = f(x) 

bind(/,app(s,f)) = app (bind (/, s), bind (/, t)) 
bind(/, abst(f)) = abst(bind(lift(/), t)) 

Note that bind is defined as in the recursive case, but now lift is not defined in 
terms of bind so all definitions are structural inductive. 

Additional to the propositions shown above, one also needs to show that Lam 
and (—)_!_ are functorial. 

Note that here Lam(h) takes the part of bind(var o h ) and thus the proof of 
(*) can be done by structural induction showing first the following two special 
instances of the third monad law: 

V/ € B — > C.Vg € A — > Lam(H). Lam(/) o bind((?) = bind(Lam(/) o g ) 
V/eB4 Lam (C).Vg € A -> B. bind(/) o Lam(^) = bind(/ o g) 

By combining those one immediately gets 

Vg € A — > Lam(H). bind (lift ( 5 )) o Lam(old y i) = Lamfolds) o bind(^) 

and from this one can easily derive (*) in the proof of Proposition 7 


lift(bind(<?) ° /) = bind (lift (g)) o lift(/) . 



The LEGO-code of the structural inductive and the general recursive proof is 
interesting in the sense that the latter version is only of half the size of the former 
- without the termination proof though. This emphasizes the significance of type 
theory with general recursion as long as termination can be ensured externally 
(possibly syntactically) . 

4.3 Substitution 

Once we have bind Lam and 7? Lam we can define a substitution operator on Lam- 
terms subst £ 77 J 4 e setLam(ylj_) -* Lam(A) — > Lam(A) as follows 

substyi(t, s) = bind(case(s, var^), t) 

The weakening weak € H( 4 e setLam(A) — » Lam(Aj_) can be written 

weak^ = bind ( var yi ± o oldyi) 

That substitution and weakening have the right properties follows immediately 
from the Kleisli properties for bind and var. As an example we show how to 
derive subst (weak(f), u) = t: 

subst(weak(t), u) = bind(case(w, var), bind(var o old, t)) 

= bind(bind(case(w, var), var o old), t) (3.) 

= bind(case(u, var) o old, t) (2.) 

= bind ( var, t) 

= t (1.) 

The numbers refer to the equations in Definition 1. 


4.4 Implementations in Haskell and SML 

Heteregeneous datatypes like Lam can be easily implemented in a functional 
language like Haskell [HJW+92]. The implementation below by Sven Panne also 
exploits predefined typeclasses like Monad and Functor in Haskell (where >>=, 
return, Maybe, Just, Nothing, maybe denote bind, r], (— )j_, old, new, and case, 
respectively). 

data Lam a = Var a 

| App (Lam a) (Lam a) 

I Abs (Lam (Maybe a) ) 

instance Functor Lam where 

fmap f x = x >>= return . f 

instance Monad Lam where 
return = Var 
Var x »= f = 

App t u >>= f = 

Abs t >>= f = 


f x 

App (t »= f) (u »= f) 
Abs (t »= lift f) 



lift : : (Monad b. Functor b) => (a -> b c) 

-> Maybe a -> b (Maybe c) 
lift f Nothing = return Nothing 
lift f (Just x) = fmap Just (f x) 

subst : : Monad a => a (Maybe b) -> a b -> a b 
subst t u = t >>= maybe u return 

Although the datatype Lam is definable in ML [HMM86], lift is not ac- 
cepted by the ML type system. The reason is that lift requires polymorphic 
recursion, which is known to be undecidable. The Haskell type system is more 
flexible because it does not try to infer the type of function if it is given anyway. 
There is also an implementation of an improved ML typechecker [EL99] which 
implements polymorphic recursion via a semialgorithm for semiunification. The 
corresponding ML-code reads as follows: 

datatype ’a Lift = new I old of J a; 

datatype ; a Lam = var of 1 a | app of (’a Lam)*( ; a Lam) 

I abs of (’a Lift) Lam; 

fun bind f (var x) = f x 

I bind f (app (t,u)) = app (bind f t,bind f u) 

| bind f (abs t) = abs (bind (lift f) t) 
and lift f new = var new 

| lift f (old x) = lam old (f x) 
and lam f = bind (var o f) ; 

fun subst t u = bind (fn new => u I old x => var x) t; 
fun weak t = lam old t; 

4.5 Implementation in LEGO 

Using the Inductive-statement such a heterogeneous datatype can be defined 
in LEGO [LP92] as follows: 

Inductive [Lambda: Set->Type] ElimOver Type 
Constructors [var : {X | Set}X->Lambda X] 

[app : {XlSet} (Lambda X)-> (Lambda X)-> (Lambda X)] 

[abst: {X|Set} (Lambda (Lift X)) -> (Lambda X)]; 

In the formalization we assume a constant ext which makes the propositional 
equality extensional and thus destroys the computational adequacy of Type The- 
ory. This problem could be overcome by moving to a Type Theory as described 
in [Alt99] . The complete LEGO code (for both variants) can be found in [RA99]. 

5 Extension to simple types 

5.1 Kleisli structures 

To capture the case of typed algebras, specifically the simply typed A-calculus, we 
introduce a generalization of the Kleisli-triples, which we call Kleisli structure: 



Definition 10. A Kleisli structure (/, F, G, r] F,G , bind F,G ) on a category C is 
given by 


— an index set / G Set 

— families of objects indexed by I F, G G I — > |C| 

— a family of morphisms indexed by i € I: r]f’ G G C(F(i), G(i)) 

— a family of functions indexed by i, j G I: 

bind ff G C (F(i),G(j)) -> C (G(t),G(j)) 
which are subject to the following equations: 

1. bind = l G(j) 

2. bind£ G (/) o n?’ G = f where / G C (F(i), G(j)). 

3 - h[nd ff ( bind ff (/) °9)= bind^'f (/) o bind^f (« g ) 

where / G C (F(j),G(k)),g G C (F(i),G(j)). 

Kleisli triples are a special case of Kleisli structures where I = |C| and F is 
the identity. Writing Cp for the category whose objects are elements of I and 
C F(i,j) = C (F(i),F(j)) we obtain a functor T : C f Cq which is given by 
the identity on objects and on morphisms / G C by 

T(/)=bii<f(^f G o/) 

In the special case of Kleisli triples this is the endofunctor on C given in section 
2.1. Since T is not an endofunctor in general it cannot be a monad. 


5.2 Dependent inductive types 

Next we model dependent inductive types, which are also called inductive fam- 
ilies, by initial algebras in categories of families [Dyb94]. Given an index type 
I G Set, we define the category of /-indexed families: objects are F G / — » Set 
and morphisms are /-indexed families of functions / G //j G /F(i) — > G(i). An in- 
ductively defined dependent type is an initial algebra in the category of /-indexed 
families. 

We assume that Set is also closed under II- types, A-types and Equality types 
Eq G IlAesetA -> A — » Set, where A G Set. We use the usual A-notation for II- 
types. Elements of A-types are given by pairs, i.e. given A G Set, B G A — > Set, 
if a G A and b G B(a) then (a,b) G Sa G A. B(a). The only inhabitant of an 
equality type is refl G //^gset/Za € A.Eq(a, a). We assume that the equality 
type is extensional, i.e. a = b holds iff Eq j4 (o, b) is inhabited. For details see e.g. 
[Mar84]. 

We define a strictly positive operator on families as a function G G (/ — > 
Set) —»/—>- Set which is given by a definition G(F) = Xi G I.a(F,i ) where 
F appears only strictly positive in a(F,i). Every strictly positive operator gives 
rise to a functor on the category of /-indexed families. 



Given a strictly positive operator G we introduce 


H G = nF £ / -> Set. At £ I.G(F, i) £ J -> Set 

to denote the initial G-Algebra. As before we define strictly positive operators 
simultaneously with dependent yu-types such that n can be used in the definition 
of new operators. We spell out the types of the constructor and iterator: 

c G £ Ili £ I.G(n°,i) — > n G (i) 

lt G £ HF £ I -> Set. (Hi e I-G(F, i) ->• F{i )) -> Hi e J./x G (i) -> F(i) 


It is convenient to present dependent inductive types by giving the con- 
structors. As an example consider the type of finite sets: Fin £ Nat — > Set, 
OFin € Fn £ Nat. Fin(succ(n)), succFin € 77„ G NatFin(n) — > Fin(succ(n)). This 
definition can be mechanically translated into the strictly positive operator 


GFin(F £ Nat — > Set) = A n £ Nat. Am £ Nat.Eq(succ(m), n) x (1 +F(m)). 

The type of c G is isomorphic to the product of the types of OFin and succFin- In- 
ductive dependent types which are indexed over several sets, like Ila £ A.B(a) — > 
Set correspond to yu-types whose index set is a A-type, i.e. Ao £ A.B(a). 

Inductively defined dependent types can be encoded in the calculus of con- 
structions along the same lines as heterogeneous datatypes, see section 3.1. 

As before we can represent the uniqueness condition by an induction princi- 
ple: Assume a family of predicates P £ IIi e in G {i) — > Prop: 


Vi £ I. c G (P(i)) C P(i) 
Vi e I.Vx £ /j, G (i).P(x ) 


Dep — Ind 


In Type Theory it is standard to use a dependent iterator which captures both 
induction and iteration. 

Heterogeneous datatypes as introduced previously can be seen as an instance 
of dependent inductive types if we assume the existence of a universe U £ Set 
which reflects all the type formers introduced so far. 


5.3 The definition of Lam for simple types 

To extend the previous construction to simply typed A-calculus we have to use 
dependent inductive types and Kleisli structures instead of triples. Given a set of 
types Ty, the base category C is the category of Ty-indexed sets, whose objects 
are families of sets indexed by types (F £ Ty — » Set) and the morphisms are 
type-indexed families of functions / £ Bt ae T y F(a) — > G{a). 

The index set I is given by the inductively defined set of contexts Con and 
the families involved are Var(P, a) - the set of variables of type a in context F 
- and Lam(P, a) - the set of terms of type a in context P. Var(P) and Lam(P) 
are objects in our base category for any P £ Con. 



We shall present the types involved by giving the constructors. The set of 
types Ty and contexts Con are given by the following homogeneous definitions: 
Ty £ Set, o € Ty, — =>■ — £ Ty -» Ty -» Ty, Con £ Set, empty £ Con, 
cons € Ty — > Con — > Con. Here cons corresponds to — j_ in the untyped case. 
Var is given by a dependently typed inductive definition: 

Var £ Con Ty — > Set 

old £ -ZTreCon-fTr £ Ty.l7 CTe T y Var(T, ct) — » Var (cons (t, T), ct) 
new £ Iir £ Con. ila £ Ty. Var ( cons (<j, T), ct) 

Similarly, Lam is given by a dependent inductive type: 

Lam £ Con — > Ty — > Set 
var £ ilreCon.aeTyV ai(r,a) -» Lam(T,a) 
app £ Lfrecon,< 7 ,reTyLam(T, a =>■ r) — > Lam(T, ct) — » Lam(T, t) 
abst £ i7reCon,cr,reTyLam(cons(CT, T),r) — » Lam(T, a => r) 

As in the untyped case var is the unit r? of our Kleisli structure. We now define 
bind and lift by simultaneous recursion: 

bind £ n rt A e con(n aeT yV&r(r, a) -» Lam (A, a)) -> 

IZ' £ r e TyLam(T, ct) -> Lam(Z\,CT) 
lift £ n r ,AeCo n nT £ Ty(iI freT y Var(T, ct) -> La m(A,(j)) -> 
7T 0 - e TyVar(cons(r, T), ct) — > Lam(cons(r, A), ct) 
lift (ct, /, new(T, ct)) = var(new(A, ct)) 
lift (ct, /, old(<7, x)) = bind(var o old (ct), /(x)) 

bind(/, var(x)) = /(x) 

bind(/, app(f, u)) = app(bind(/, t), bind ( 3 , t)) 
bind(/, abst(t)) = abst (bind (lift (ct, /), t)) 

The termination argument is the same as for the untyped case, see Section 4.1. 
5.4 Lam is a Kleisli structure 

The verification of this fact has the same structure as the previous proof but 
with different types. Let us state the result precisely: 

Theorem 11. Lam gives rise to a Kleisli structure where 

— C is the category of Ty -indexed families. 

— 1 = Con 

— F = Var £ Con -> |C| 

— G — Lam £ Con — > |C| 

— rjr = var r € C(Var(T), Lam(T)) 

— bind_r,ii € C(Var(T), Lam(A)) — > C(Lam(T), Lam(A)) 

Proof See the proofs of Corollary 9. 



6 Conclusions and open problems 


We have discussed a uniform representation of untyped and typed A-terms based 
on Kleisli triples in type theory using heterogeneous (generalized) datatypes. All 
this can be easily implemented in Haskell and in a special version of SML and 
formally verified in LEGO. The recursive construction of the Kleisli-triple turned 
out to be much simpler than the structural inductive one which emphasizes our 
point of view that recursive proofs are often easier and should be supported by 
modern type theoretical systems. It is future work to look for a generalization 
to terms of dependently typed A-calculi, thus suggesting a new approach for the 
project of Type Theory in Type Theory (cf. [MP93]). A problem which needs 
to be tackled in this context is that the type of the substitution function in a 
dependently typed context may depend on its own graph. 

Once having finished the examination of the Lam-monad and turning atten- 
tion to other examples of heterogeneous datatypes many interesting questions 
arise that deserve further investigation. There exist practically interesting ex- 
amples that need a stronger notion of inductively defined functors, not just 
operators. Moreover, can one find a useful characterisation of “being Kleisli” for 
inductive families? A challenging open question is whether inductively defined 
operators are proof-theoretically conservative with respect to standard induc- 
tive ones, i.e. can one define more functions on natural numbers using inductive 
operators? 
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